

We claim:

1. An interconnection system for a plurality of processing elements (PEs), each PE having a communications port for transmitting and receiving data and commands, the interconnection system comprising:

5                   inter-PE connection paths; and  
                  a cluster switch connected to said PEs so as to combine mutually exclusive inter-PE connection paths and to thereby substantially reduce the number of communications paths  
10                  required to provide inter-PE connectivity equivalent to that of conventional torus-connected PE arrays.

15                  2. The interconnection system of claim 1, wherein said cluster switch further comprises connections to provide direct communications between transpose PEs.

20                  3. The interconnection system of claim 1, wherein data and commands may be transmitted and received at said communications ports in one of four selectable modes:

25                  a) transmit east/receive west mode for transmitting data to an east PE via the communications port while receiving data from a west PE via the communications port;

                  b) transmit north/receive south mode for transmitting data to a north PE via the communications port while receiving data from a south PE via the communications port;

                  c) transmit south/receive north mode for transmitting data to an south PE via the communications port while receiving data from a north PE via the communications port;  
                  and

30                  d) transmit west/receive east mode for transmitting data to a west PE via the communications port while receiving data from an east PE via the communications port.

35                  4. The interconnection system of claim 3, wherein data and commands may be transmitted and received in a fifth selectable, transpose transmit/receive, mode for transmitting

and receiving between transpose PEs.

5. The interconnection system of claim 4, further comprising a controller and memory system connected to the PEs to simultaneously send control information to each PE's control port and to send data to each PE's data port for loading into registers at each PE.

10. 6. The interconnection system of claim 5, wherein said inter-PE connection paths are selectively switched by path enabling signals.

15. 7. The interconnection system of claim 5, wherein said path enabling signals are generated based upon control information from said controller.

20. 8. The interconnection system of claim 4, wherein each communication port includes B-bit wide transmission and reception paths, where B is an integer greater than or equal to one.

25. 9. The interconnection system of claim 1, wherein each PE is connected to selectively send commands or data over a communications port while receiving data or commands via another communications port, based upon communications instructions received through the control port and decoded in the control logic residing on each of the PEs.

30. 10. The interconnections system of claim 9, wherein the communication instruction is received by the control logic from a controller over said control port.

35. 11. The interconnection system of claim 9, wherein the cluster switch supports an operation wherein the PEs are each simultaneously sending commands or data while receiving commands or data.

12. The interconnections system of claim 11, wherein  
said simultaneous operation is selectively switched such that  
the PEs are each simultaneously sending commands or data while  
receiving data or instructions.

13. An array processor, comprising:  
a plurality of processing elements (PEs), each PE having  
a single inter-PE communications port; and  
inter-PE communications paths connected to provide  
inter-PE connectivity equivalent to that of a conventional  
torus connected array.

14. The array processor of claim 13, further comprising  
inter-PE communications paths connected to provide direct  
transpose PE communications.

15. An array processor, comprising:  
a plurality of processing elements (PEs) arranged in  
clusters;  
inter-PE communications paths connected such that the PEs  
of each cluster communicate in mutually exclusive directions  
with the PEs of each of at least two other clusters; and  
cluster switches connected to multiplex inter-PE  
communications in said mutually exclusive directions.

16. An array processor, comprising:  
N clusters of M processing elements, each processing  
element having a communications port through which the  
processing element transmits and receives data over a total of  
B wires;  
communications paths which are less than or equal to  
(M)(B)-wires wide connected between pairs of said clusters;  
each cluster member in the pair containing processing elements  
which are torus nearest neighbors to processing elements in  
the other

cluster of the pair, each path permitting communications between said cluster pairs in two mutually exclusive torus directions, that is, South and East or South and West or North and East or North and West; and

5       multiplexers connected to combine 2(M)(B)-wire wide communications into said less than or equal to (M)(B)-wires wide paths between said cluster pairs.

10       17. The array processor of claim 16, wherein the processing elements of each cluster communicate to the North and West torus directions with one cluster and to the South and East torus directions with another cluster.

15       18. The array processor of claim 16, wherein the processing elements of each cluster communicate to the North and East torus directions with one cluster and to the South and West torus directions with another cluster.

20       19. The array processor of claim 16, wherein at least one cluster includes an  $N \times N$  torus transpose pair.

25       20. The array processor of claim 16, wherein a cluster switch comprises said multiplexers and said cluster switch is connected to multiplex communications received from two mutually exclusive torus directions to processing elements within a cluster.

30       21. The array processor of claim 20, wherein said cluster switch is connected to multiplex communications from the processing elements within a cluster for transmission to another cluster.

35       22. The array processor of claim 21, wherein said cluster switch is connected to multiplex communications between transpose processing elements within a cluster.

23. The array processor of claim 16, wherein N is greater than or equal to M.

24. The array processor of claim 16, wherein N is less than M.

5 25. An array processor, comprising:

10 N clusters of M processing elements, each processing element having a communications port through which the processing element transmits and receives data over a total of B wires and each processing element within a cluster being formed in closer physical proximity to other processing elements within a cluster than to processing elements outside the cluster;

15 communications paths which are less than or equal to (M) (B)-wires wide connected between pairs of said clusters, each cluster member in the pair containing processing elements which are torus nearest neighbors to processing elements in the other cluster of the pair, each path permitting communications between said cluster pairs in two mutually exclusive torus directions, that is, South and East or South and West or North and East or North and West; and

20 25 multiplexers connected to combine 2(M) (B)-wire wide communications into said less than or equal to (M) (B)-wires wide paths between said cluster pairs.

30 26. The array processor of claim 25, wherein the processing elements of each cluster communicate to the North and West torus directions with one cluster and to the South and East torus directions with another cluster.

35 27. The array processor of claim 25, wherein the processing elements of each cluster communicate to the North and East torus directions with one cluster and to the South and West torus directions with another cluster.

28. The array processor of claim 25, wherein at least one cluster includes an  $N \times N$  torus transpose pair.

5 29. The array processor of claim 25, wherein a cluster switch comprises said multiplexer and said cluster switch is connected to multiplex communications received from two mutually exclusive torus directions to processing elements within a cluster.

10 30. The array processor of claim 29 wherein said cluster switch is connected to multiplex communications from the processing elements within a cluster for transmission to another cluster.

15 31. The array processor of claim 30, wherein said cluster switch is connected to multiplex communications between transpose processing elements within a cluster.

20 32. The array processor of claim 25, wherein  $N$  is less than or equal to  $M$ .

33. The array processor of claim 25, wherein  $N$  is greater than  $M$ .

25 34. The array processor of claim 25, wherein communications between processing elements is bit-serial and each processing element cluster communicates with two other clusters over said communications paths.

30 35. The array processor of claim 25, wherein the communications paths between processing elements comprise a data bus.

35 36. The array processor of claim 25, wherein said communications paths are bidirectional paths.

37. The array processor of claim 25, wherein said communications paths comprise unidirectional signal wires.

5 38. The array processor of claim 25, wherein P and Q are the number of rows and columns, respectively, of a torus connected array having the same number of PEs as said array, and P and Q are equal to N and M, respectively.

10 39. The array processor of claim 25, wherein P and Q are the number of rows and columns, respectively, of a torus connected array having the same number of PEs and P and Q are equal to M and N, respectively.

15 40. An array processor, comprising:  
processing elements (PEs)  $PE_{i,j}$ , where i and j refer to the respective row and column PE positions within a conventional torus-connected array, and where  $i = 0, 1, 2, \dots, N-1$  and  $j = 0, 1, 2, \dots, N-1$ , said PEs arranged in clusters  $PE_{(i+a) \pmod{N}, (j+N-a) \pmod{N}}$ , for any  $i, j$  and for all  $a \in \{0, 1, \dots, N-1\}$ ; and

20 cluster switches connected to multiplex inter-PE communications paths between said clusters thereby providing inter-PE connectivity equivalent to that of a torus-connected array.

25 41. The array processor of claim 40, wherein said cluster switches are further connected to provide direct communications between PEs in a transpose PE pair within a cluster.

30 42. The array processor of claim 40, wherein said clusters are scaleable.

35 43. A method of forming an array processor, comprising the steps of:

arranging processing elements in N clusters of M

processing elements, such that each cluster includes processing elements which communicate only in mutually exclusive torus directions with the processing elements of at least one other cluster; and

- 5       multiplexing said mutually exclusive torus direction communications.

1  
2  
3  
4  
5  
6  
7  
8  
9  
10  
11  
12  
13  
14  
15  
16  
17  
18  
19  
20  
21  
22  
23  
24  
25  
26  
27  
28  
29  
30  
31  
32  
33  
34  
35  
36  
37  
38  
39  
40  
41  
42  
43  
44  
45  
46  
47  
48  
49  
50  
51  
52  
53  
54  
55  
56  
57  
58  
59  
60  
61  
62  
63  
64  
65  
66  
67  
68  
69  
70  
71  
72  
73  
74  
75  
76  
77  
78  
79  
80  
81  
82  
83  
84  
85  
86  
87  
88  
89  
90  
91  
92  
93  
94  
95  
96  
97  
98  
99  
100  
101  
102  
103  
104  
105  
106  
107  
108  
109  
110  
111  
112  
113  
114  
115  
116  
117  
118  
119  
120  
121  
122  
123  
124  
125  
126  
127  
128  
129  
130  
131  
132  
133  
134  
135  
136  
137  
138  
139  
140  
141  
142  
143  
144  
145  
146  
147  
148  
149  
150  
151  
152  
153  
154  
155  
156  
157  
158  
159  
160  
161  
162  
163  
164  
165  
166  
167  
168  
169  
170  
171  
172  
173  
174  
175  
176  
177  
178  
179  
180  
181  
182  
183  
184  
185  
186  
187  
188  
189  
190  
191  
192  
193  
194  
195  
196  
197  
198  
199  
200  
201  
202  
203  
204  
205  
206  
207  
208  
209  
210  
211  
212  
213  
214  
215  
216  
217  
218  
219  
220  
221  
222  
223  
224  
225  
226  
227  
228  
229  
230  
231  
232  
233  
234  
235  
236  
237  
238  
239  
240  
241  
242  
243  
244  
245  
246  
247  
248  
249  
250  
251  
252  
253  
254  
255  
256  
257  
258  
259  
260  
261  
262  
263  
264  
265  
266  
267  
268  
269  
270  
271  
272  
273  
274  
275  
276  
277  
278  
279  
280  
281  
282  
283  
284  
285  
286  
287  
288  
289  
290  
291  
292  
293  
294  
295  
296  
297  
298  
299  
300  
301  
302  
303  
304  
305  
306  
307  
308  
309  
310  
311  
312  
313  
314  
315  
316  
317  
318  
319  
320  
321  
322  
323  
324  
325  
326  
327  
328  
329  
330  
331  
332  
333  
334  
335  
336  
337  
338  
339  
340  
341  
342  
343  
344  
345  
346  
347  
348  
349  
350  
351  
352  
353  
354  
355  
356  
357  
358  
359  
360  
361  
362  
363  
364  
365  
366  
367  
368  
369  
370  
371  
372  
373  
374  
375  
376  
377  
378  
379  
380  
381  
382  
383  
384  
385  
386  
387  
388  
389  
390  
391  
392  
393  
394  
395  
396  
397  
398  
399  
400  
401  
402  
403  
404  
405  
406  
407  
408  
409  
410  
411  
412  
413  
414  
415  
416  
417  
418  
419  
420  
421  
422  
423  
424  
425  
426  
427  
428  
429  
430  
431  
432  
433  
434  
435  
436  
437  
438  
439  
440  
441  
442  
443  
444  
445  
446  
447  
448  
449  
450  
451  
452  
453  
454  
455  
456  
457  
458  
459  
460  
461  
462  
463  
464  
465  
466  
467  
468  
469  
470  
471  
472  
473  
474  
475  
476  
477  
478  
479  
480  
481  
482  
483  
484  
485  
486  
487  
488  
489  
490  
491  
492  
493  
494  
495  
496  
497  
498  
499  
500  
501  
502  
503  
504  
505  
506  
507  
508  
509  
510  
511  
512  
513  
514  
515  
516  
517  
518  
519  
520  
521  
522  
523  
524  
525  
526  
527  
528  
529  
530  
531  
532  
533  
534  
535  
536  
537  
538  
539  
540  
541  
542  
543  
544  
545  
546  
547  
548  
549  
550  
551  
552  
553  
554  
555  
556  
557  
558  
559  
560  
561  
562  
563  
564  
565  
566  
567  
568  
569  
570  
571  
572  
573  
574  
575  
576  
577  
578  
579  
580  
581  
582  
583  
584  
585  
586  
587  
588  
589  
590  
591  
592  
593  
594  
595  
596  
597  
598  
599  
600  
601  
602  
603  
604  
605  
606  
607  
608  
609  
610  
611  
612  
613  
614  
615  
616  
617  
618  
619  
620  
621  
622  
623  
624  
625  
626  
627  
628  
629  
630  
631  
632  
633  
634  
635  
636  
637  
638  
639  
640  
641  
642  
643  
644  
645  
646  
647  
648  
649  
650  
651  
652  
653  
654  
655  
656  
657  
658  
659  
660  
661  
662  
663  
664  
665  
666  
667  
668  
669  
670  
671  
672  
673  
674  
675  
676  
677  
678  
679  
680  
681  
682  
683  
684  
685  
686  
687  
688  
689  
690  
691  
692  
693  
694  
695  
696  
697  
698  
699  
700  
701  
702  
703  
704  
705  
706  
707  
708  
709  
710  
711  
712  
713  
714  
715  
716  
717  
718  
719  
720  
721  
722  
723  
724  
725  
726  
727  
728  
729  
730  
731  
732  
733  
734  
735  
736  
737  
738  
739  
740  
741  
742  
743  
744  
745  
746  
747  
748  
749  
750  
751  
752  
753  
754  
755  
756  
757  
758  
759  
760  
761  
762  
763  
764  
765  
766  
767  
768  
769  
770  
771  
772  
773  
774  
775  
776  
777  
778  
779  
770  
771  
772  
773  
774  
775  
776  
777  
778  
779  
780  
781  
782  
783  
784  
785  
786  
787  
788  
789  
790  
791  
792  
793  
794  
795  
796  
797  
798  
799  
800  
801  
802  
803  
804  
805  
806  
807  
808  
809  
810  
811  
812  
813  
814  
815  
816  
817  
818  
819  
820  
821  
822  
823  
824  
825  
826  
827  
828  
829  
830  
831  
832  
833  
834  
835  
836  
837  
838  
839  
840  
841  
842  
843  
844  
845  
846  
847  
848  
849  
850  
851  
852  
853  
854  
855  
856  
857  
858  
859  
860  
861  
862  
863  
864  
865  
866  
867  
868  
869  
870  
871  
872  
873  
874  
875  
876  
877  
878  
879  
880  
881  
882  
883  
884  
885  
886  
887  
888  
889  
880  
881  
882  
883  
884  
885  
886  
887  
888  
889  
890  
891  
892  
893  
894  
895  
896  
897  
898  
899  
900  
901  
902  
903  
904  
905  
906  
907  
908  
909  
910  
911  
912  
913  
914  
915  
916  
917  
918  
919  
920  
921  
922  
923  
924  
925  
926  
927  
928  
929  
930  
931  
932  
933  
934  
935  
936  
937  
938  
939  
940  
941  
942  
943  
944  
945  
946  
947  
948  
949  
950  
951  
952  
953  
954  
955  
956  
957  
958  
959  
960  
961  
962  
963  
964  
965  
966  
967  
968  
969  
970  
971  
972  
973  
974  
975  
976  
977  
978  
979  
980  
981  
982  
983  
984  
985  
986  
987  
988  
989  
980  
981  
982  
983  
984  
985  
986  
987  
988  
989  
990  
991  
992  
993  
994  
995  
996  
997  
998  
999  
1000